PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG029805t3
Common NameTCM_029805
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HB-other
Protein Properties Length: 1713aa    MW: 193446 Da    PI: 5.3002
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG029805t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.88.9e-192984257
                      T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
          Homeobox  2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57
                       kR+  t++qle+Le++++ + yps+++r+ L++klgL++rq ++WF+ rR kekk
  Thecc1EG029805t3 29 PKRQMKTPYQLEALEKAYALETYPSEATRAGLSEKLGLSDRQLQMWFCHRRLKEKK 84
                      69*****************************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.0E-18785IPR009057Homeodomain-like
SuperFamilySSF466891.41E-161785IPR009057Homeodomain-like
PROSITE profilePS5007116.2122585IPR001356Homeobox domain
SMARTSM003891.7E-162789IPR001356Homeobox domain
PfamPF000461.7E-162984IPR001356Homeobox domain
CDDcd000861.89E-133085No hitNo description
SMARTSM005714.2E-22512571IPR018501DDT domain
PROSITE profilePS5082716.486512571IPR018501DDT domain
PfamPF027911.0E-16513568IPR018501DDT domain
PfamPF050667.3E-15694761IPR007759HB1/Asxl, restriction endonuclease HTH domain
PfamPF156123.0E-8907951IPR028942WHIM1 domain
PfamPF156131.5E-1210921164IPR028941WHIM2 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010228Biological Processvegetative to reproductive phase transition of meristem
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1713 aa     Download sequence    Send to blast
MDPGSEEENN PSKNPNKNVN SSNEGHVKPK RQMKTPYQLE ALEKAYALET YPSEATRAGL  60
SEKLGLSDRQ LQMWFCHRRL KEKKETPSKK PRKGAALPPE SPIDDLHAGP EPDYGSGSGS  120
GSSPYTDSRK LGGSSSRGMT EDVPTARRYY ESQQSIMELR AIACVEAQLG EPLRDDGPML  180
GMEFDPLPPD AFGAIPEPQK RSGHPYESKA YERHDGRSSK AAVRALHEYQ FLPEHASLRS  240
DAYGQVTQSH FHESPVDGAR ARATSFVHGE EPLPRVHGIQ ERESFTNGRL NTQSIGHPVL  300
GSEDSYVLST GQTLNIDADL RNDRKRKSDE NRIAREVEAH ENRIRKELEK LDLKRRKSEE  360
RMRKEMERHA RERRKEEERL VREKQREEER SQREQRREME RREKFLQKEC LRAEKRRQKE  420
ELRREKEAER RRVAMEKATA RKIAKESMDL IEDEQLELME LAAASKGIPS IIHLDHDSLQ  480
NLESFRDSLS LFPPKSVQLK RPFAIQPWID SEENVGNLLM AWRFLITFAD VLRLWPFTLD  540
EFVQAFHDYD SRLLGEIHVA LLKSIIKDIE DVARTPSTGL GMNQYCAANP EGGHPQIVEG  600
AYSWGFDIRN WQRHLNPLTW PEIFRQLAIS AGLGPQLKKR NAAWTFMGDN DEGKGCEDVV  660
STLRNGSAAE NAFVLMREKG LLLPRRSRHR LTPGTVKFAA FHVLSLEGRE GLTVLELADK  720
IQKSGLRDLT TSKTPEASIS VALTRDAKLF ERIAPSTYCV RPAYRKDPTD AEAILAAARK  780
KIRQFENGFL GGEDADEVER DEVERDEESE CDVDEEPEVD DIATPSNANK DADYPKDEVN  840
TCSGSGKVHV STDALNVPSE FDKDFSSFPP NIMKDANGPS NTGQYVAREE MGTGNPDQQN  900
IEIDESKSGE SWIQGLSEGE YSHLSVEERL NALVALIGIA NEGNSIRAVL EDRLEAANAL  960
KKQMWVEAQL DKSRLKEETM VKMDFPSMMG IKAEPQLPNS VVEGSQSPFP AAYNKNDEAS  1020
PSIPDDQKPL LCSQNVQNDL NSYPAERALV LQEASMGPDN FSAQQIGHAS KRSRSQLKSY  1080
IAHRAEEMYV YRSLPLGQDR RRNRYWQFVA SASKNDPCSG RIFVELRDGN WRLIDSEEAF  1140
DTLLTSLDAR GIRESHLRIM LQKIETSFKE NVRRNLQCAR AIGRSGSSTE NEVSELDSSP  1200
DFPASFDSPS SAICGLNFDA LETLPSFKIQ LGRNENEKKL ALKRYQDFQR WIWKECYNSS  1260
TLCAMKYGKK RCVQLLAVCD VCLRSHIPEE MHCGYCHQTF GSVNNSFNFS EHEIQCKENR  1320
KLDTKDTCTI DYSLPLGISL LKSLCALVEV SIPPEALESV WIEGRRKMWG RELNASSSVD  1380
ELLKILTHLE SAIKRDHLLS NFETTKELLG SNLQSESDSS VSVLPWIPET TAAVALRLLE  1440
LDVSIMCVKQ EKVEPSENKE ARAYIKLPSR TSLFIKNKEL ELKELDQDEA MKEENFADMS  1500
HSKRNSYKRG RGGREQGSGR KWQRRASGSR YDTGKRSARE KNNLSFRLKQ QGQRTNGRSS  1560
GRGRRTVRKR AERRAADNTM VARVADVIKP KVSDVRDLDE EWRTEKFRVM QMVNPPDSNS  1620
AEEESDDNAQ GEGYGQGNWD LDYNGASNGW NAEAMEASDE DDDAYEDDNG VEQLGEEDSD  1680
GDLEISDASD VVANKAGNDD GSDLAVSEDY SD*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17784RRLKEKKE
2353374KRRKSEERMRKEMERHARERRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAJ5665762e-78AJ566576.1 Theobroma cacao microsatellite, clone mTcCIR255.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007025542.10.0Homeodomain-like transcriptional regulator, putative isoform 3
SwissprotF4HY560.0RLT1_ARATH; Homeobox-DDT domain protein RLT1
TrEMBLA0A061GM780.0A0A061GM78_THECC; Homeodomain-like transcriptional regulator, putative isoform 3
STRINGPOPTR_0011s05660.10.0(Populus trichocarpa)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G28420.10.0homeobox-1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]